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Genes Encoding Epsilon Lycopene Cyclase and Method 
for Producing Bicyclic Epsilon Carotene 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention describes the DNA sequence for eukaryotic genes encoding e 
lycopene cyclase as well as vectors containing the same and hosts transformed with these 
vectors. The present invention also provides a method for augmenting the accumulation of 
carotenoids and production of novel and rare carotenoids. The present invention provides 
methods for controlling the ratio of various carotenoids in a host. Additionally, the present 
invention provides a method for screening for eukaryotic genes encoding enzymes of 
carotenoid biosynthesis and metabolism. The invention also provides transgenic plants 
having therapeutic properties, methods for preparing a therapeutic composition, and 
methods for treating disease by administering the therapeutic plants and compositions. 

Discussion of the Background 

Carotenoid pigments with cyclic endgroups are essential components of the 
photosynthetic apparatus in oxygenic photosynthetic organisms (e.g., cyanobacteria, algae 
and plants; Goodwin, 1980). The symmetrical bicyclic yellow carotenoid pigment 0- 
carotene (or, in rare cases, the asymmetrical bicyclic a-carotene) is intimately associated 
with the photosynthetic reaction centers and plays a vital role in protecting against 
potentially lethal photooxidative damage (Koyama, 1991). p-carotene and other carotenoids 
derived from it or from a-carotene also serve as light-harvesting pigments (Siefermann- 
Harms, 1987), are involved in the thermal dissipation of excess light energy captured by the 
light-harvesting antenna (Demmig-Adams & Adams, 1992), provide substrate for the 
biosynthesis of the plant growth regulator abscisic acid (Rock & Zeevaart, 1991; Parry & 
Horgan, 1991), and are precursors of vitamin A in human and animal diets (Krinsky, 1987). 
Plants also exploit carotenoids as coloring agents in flowers and fruits to attract pollinators 
and agents of seed dispersal (Goodwin, 1980). The color provided by carotenoids is also of 
agronomic value in a number of important crops. Carotenoids are currently harvested from 
plants for use as pigments in food and feed. 
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Two types of cyclic endgroups are commonly found in higher plant carotenoids, 
these are referred to as the P and e cyclic endgroups (Fig. 2; the acyclic endgroup is 
referred to as the Y or psi endgroup). These cyclic endgroups differ only in the position of 
the double bond in the ring. Carotenoids with two P rings are ubiquitous, and those with 
one P and one € ring are common, but carotenoids with two e rings are found in significant 
amounts in relatively few plants. P-Carotene (Fig. 1) has two P endgroups and is a 
symmetrical compound that is the precursor of a number of other important plant 
carotenoids such as zeaxanthin and violaxanthin (Fig. 1). 

Carotenoid enzymes have previously been isolated from a variety of sources 
including bacteria (Armstrong et al., 1989, Mol. Gen. Genet. 216, 254-268; Misawa et al., 
1990, J. Bacterid., 172, 6704-12), fungi (Schmidhauser et al., 1990, Mol. Cell. Biol. 10, 
5064-70), cyanobacteria (Chamovitz et al., 1990, Z. Naturforsch, 45c, 482-86) and higher 
plants (Bartley et al., Proc. Natl. Acad. Sci USA 88, 6532-36; Martinez-Ferez & Vioque, 
1992, Plant Mol. Biol. 18, 981-83). Many of the isolated enzymes show a great diversity in 
function and inhibitory properties between sources. For example, phytoene desaturases 
from Synechococcus and higher plants carry out a two-step desaturation to yield C-carotene 
as a reaction product; whereas the same enzyme from Erwinia introduces four double bonds 
forming lycopene. Similarity of the amino acid sequences are very low for bacterial versus 
plant enzymes. Therefore, even with a gene in hand from one source, it is difficult to 
screen for a gene with similar function in another source. In particular, the sequence 
similarity between bacterial/fungal and cyanobacterial/plants genes is quite low. 

The difficulties in isolating related genes is exemplified by recent efforts to isolated 
the enzyme which catalyzes the formation of P-carotene from the acyclic precursor 
lycopene. Although this enzyme had been isolated in a bacterium, prior to the invention 
described in U.S. Serial No. 08/142, 195 (which is hereby incorporated by reference in its 
entirety), it had not been isolated from any photosynthetic organism nor had the 
corresponding genes been identified and sequenced or the cofactor requirements established. 
The isolation and characterization of the enzyme catalyzing formation of P-carotene in the 
cyanobacterium Synechococcus PCC7942 was described by Cunningham et al. in 1993 and 
1994. 

The P-cyclase of Arabidopsis adds two rings to the symmetrical lycopene to form the 
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bicyclic p-carotene, but the related e -cyclase of Arabidopsis, which has 36% identity for the 
predicted amino acid sequences) adds only a single ring to form the monocyclic 6-carotene 
(Cunningham et al, 1996, Plant Cell 8:1613-1626; U.S. Application No. 08/624,125 filed 
March 29, 1996, which is incorporated by reference herein in its entirety). These 
differences in function provide a simple mechanism for adjusting the proportions of p,p-and 
P,e-carotenoids while at the same time preventing formation of carotenoids with two epsilon 
rings. 

In view of the afore-mentioned deficiencies with prior art methods of producing 
carotenoids with two epsilon rings, it is clear that there exists a need in the art for such 
methods. 

SUMMARY OF THE INVENTION 
Accordingly, a first object of this invention is to provide isolated eukaryotic genes 
which encode enzymes which encode lycopene epsilon cyclases which form bicyclic epsilon- 
carotene. 

A second object of the present invention is to provide vectors containing said genes. 
A third object of the present invention is to provide hosts transformed with said 
vectors. 

A further object is to provide a method for producing a lycopene epsilon cyclase 
using the transformed host. 

A still further object is to provide the lycopene epsilon cyclase so produced. 

Another object of the present invention is to provide hosts which accumulates novel 
or rare carotenoids or which overexpress known carotenoids. 

Yet another object of the invention is to provide a method for producing novel or 
rare carotenoids. 

Another object of this invention is to secure the expression of eukaryotic carotenoid- 
related genes in a recombinant prokaryotic host. 

An additional object of the invention is a method of preparing a therapeutic 
composition comprising either the host cell which expresses the lycopene epsilon cyclase or 
the isolated carotenoids produced by the host cell containing the lycopene epsilon cyclase. 

Another object of the invention is to provide a method for the treatment of disease by 
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providing to a patient in need thereof, an amount of the rare carotenoids in an amount 
sufficient to treat the disease. 

These and other objects of the present invention have been realized by the present 
inventors as described below. 

BRIEF DESCRIPTION OF THE DRAWINGS 
A more complete appreciation of the invention and many of the attendant advantages 
thereof will be readily obtained as the same becomes better understood by reference to the 
following detailed description when considered in connection with the accompanying 
drawings, wherein: 

Figure 1 depicts possible routes of synthesis of cyclic carotenoids and some common 
plant and algal xanthophylls (oxycarotenolds) from lycopene. Activities of the €- cyclase 
enzyme of lettuce are indicated by bold arrows labelled with e. The reaction leading to e- 
carotene from 6-carotene is not catalyzed by the lycopene € cyclase of Arabidopsis 
(Cunningham 1996; U.S. Serial No. 08/624,125) or other known e-cyclases. Therefore, 
formation of e-cartene and carotenoids derived from it is now made possible with the lettuce 
lycopene e-cyclase describe herein. Arrows labelled with p indicate reactions synthesize by 
P-cyclase. 

Figure 2 depicts the carotene endgroups which are commonly found in plants. 

Figure 3 is a DNA sequence of the romaine lettuce cDNA (SEQ ID NO:l) encoding 
lycopene epsilon cyclase. 

Figure 4 is the predicted amino acid sequence of the romaine lettuce lycopene 
epsilon cyclase (SEQ ID NO:2). 

Figure 5 is a comparison between the predicted amino acid sequences of romaine 
lettuce (from clone DY4; SEQ ID NO:2) and Arabidopsis (from clone y2; SEQ ID NO:3) 
lycopene epsilon cyclase. 

Figure 6 shows the nucleotide and amino acid sequences of the e-cyclase #3 of 
Adonis palaestina, which also forms bicyclic epsilon carotene (SEQ ID NO: 4 and 5). 

Figure 7 Shows a sequence comparison of the Adonis palaestina €-cyclase #3 (SEQ 
ID NO: 5) compared to the Adonis palaestina e-cyclase #5 (SEQ ID NO: 6), the latter of 
which adds only a single epsilon ring to lycopene. Five amino acid differences are noted, 
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which may be targets for site-directed mutagenesis to form the lycopene €-cyclase which 
adds two e rings to lycopene. 

PET AI LED DES CRIPT ION 

Romaine lettuce is one of the rare plant species that produces an abundance of a 
carotenoid with two epsilon rings (lactucaxanihin). The present inventors have isolated a 
gene encoding the epsilon cyclase from this plant, and have found that it is similar in 
sequence to that of Arabidopsis (about 65% identity). However, the lettuce enzyme 
efficiently adds two epsilon rings to lycopene to form the bicyclic epsilon-carotene. 

The present invention also relates to methods for transforming known carotenoids 
into novel or rare products. That is, currently e-carotene (see Figure 1) and y-carotene can 
only be isolated in minor amounts. As described below, the enzymes of the invention can 
be produced and used to transform lycopene to bicyclic € -carotene. With such a product in 
hand, bulk biosynthesis of other carotenoids derived from the bicyclic epsilon carotene are 
possible. 

The eukaryotic genes in the carotenoid biosynthetic pathway differ from their 
prokaryotic counterparts in their 5* region. As used herein, the 5' region is the region of 
eukaryotic DNA which precedes the initiation codon of the counterpart gene in prokaryotic 
DNA. That is, when the consensus areas of eukaryotic and prokaryotic genes are aligned, 
the eukaryotic genes contain additional coding sequences upstream of the prokaryotic 
initiation codon. 

The invention also relates to genes encoding lycopene epsilon cyclase which are 
truncated at the 5' region of the gene. Preferably, such truncated genes are truncated to 
within 0-50, preferably 0-25, codons of the 5* initiation codon of their prokaryotic 
counterparts as determined by alignment maps. 

In addition to novel enzymes produced by truncating the 5' region of known 
enzymes, novel enzymes which can participate in the formation of novel carotenoids can be 
formed by replacing portions of one gene with an analogous sequence from a structurally 
related gene. The information for adding two epsilon rings can be found in the 3 1 half of 
the romaine lettuce gene. Thus, one example of such a hybrid gene construct would include 
the first half of the romaine lettuce cyclase gene in combination with the second (3*) half of 
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another plant cyclase gene, such as the potato gene or by random of site directed 
mutagenesis of a mono-€ cyclase. 



Vectors 

The genes encoding the carotenoid enzymes as described above, when cloned into a 
suitable expression vector, can be used to overexpress these enzymes in a plant expression 
system or to inhibit the expression of these enzymes. The production or the biochemical 
activity of the product of epsilon-cyclase genes and cDNAs may be reduced or inhibited by 
a number of different approaches available to those skilled in the art [including but not 
limited to such methodologies or approaches as anti-sense (e.g., Gray et al (1992)Plant Mol. 
Biol. 19:69-87), ribozymes (e.g., Wegener et al (1994) Mol. Gen. Genet. 245:465-470), co- 
suppression (e.g., Fray and Grierson(1993) Plant Mol. Biol. 22:589-602), targeted 
disruption of the gene (e.g., Schaefer et al. (1997) Plant J. 11:1195-1206), intracellular 
antibodies (e.g., Rondon and Marasco (1997) Ann. Rev. Microbiol. 51:257-283 or 
whatever other approaches rely on the knowledge or availability of the gene, cDNA, or 
polypeptide and/or the sequences of these] to thereby reduce accumulation of carotenoids 
with psilon rings and compounds derived from them. 

For example, a vector containing the gene encoding e-cyclase can be used to 
increase the amount of bicyclic epsilon-carotene in an organism and thereby alter the 
nutritional value, pharmacology and visual appearance value of the organism. In addition, 
the transformed organism can be used in the formulation of therapeutic agents, for example 
in the treatment of cancer (Mayne et al (1996) FASEB J. 10:690-701; Tsushima et al (1995) 
Biol. Pharm. Bull. 18:227-233, which are both incorporated herein by reference in their 
entireties). 

In a preferred embodiment, the vectors of the present invention contain a DNA 
encoding an eukaryotic IPP isomerase upstream of a DNA encoding a second eukaryotic 
carotenoid enzyme. The inventors have discovered that inclusion of an IPP isomerase gene 
increases the supply of substrate for the carotenoid pathway; thereby enhancing the 
production of carotenoid endproducts. This is apparent from the much deeper pigmentation 
in carotenoid-accumulating colonies of E. coli which also contain one of the aforementioned 
IPP isomerase genes when compared to colonies that lack this additional IPP isomerase 
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gene. Similarly, a vector comprising an IPP isomerase gene can be used to enhance 
production of secondary metabolites of dimethylallyl pyrophosphate (such as isoprenoids, 
steroids, carotenoids, etc.). 

Alternatively, an anti-sense strand of one of the above genes can be inserted into a 
vector. For example, the e-cyclase gene can be inserted into a vector and incorporated into 
the genomic DNA of a host, thereby inhibiting the synthesis of e,(J carotenoids (lutein and 
a-carotene) and enhancing the synthesis of bicyclic epsilon carotenoids. 

Suitable vectors according to the present invention comprise a eukaryotic gene 
encoding an enzyme involved in carotenoid biosynthesis or metabolism and a suitable 
promoter for the host can be constructed using techniques well known in the art (for 
example Sambrook et al.. Molecular Cloning A Laboratory Manual. Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, 1989). 

Suitable vectors for eukaryotic expression in plants are described in Frey et al., Plant 
J. (1995) 8(5):693 and Misawa et al, 1994; incorporated herein by reference in their 
entireties. 

Suitable vectors for prokaryotic expression include pACYC184, pUC119, and 
pBR322 (available from New England BioLabs, Bevery, MA), pTrcHis (Invitrogen), 
Bluescript SK (Stratagene) and pET28 (Novagen) and derivatives thereof. 

The vectors of the present invention can additionally contain regulatory elements 
such as promoters, repressors selectable markers such as antibiotic resistance genes, etc. 

Hosts 

Host systems according to the present invention can comprise any organism that 
already produces carotenoids or which has been genetically modified to produce 
carotenoids. 

Organisms which already produce carotenoids include plants, algae, some yeasts, 
fungi and cyanobacteria and other photosynthetic bacteria. Transformation of these hosts 
with vectors according to the present invention can be done using standard techniques such 
as those described in Misawa et al., (1990) supra; Hundle et al., (1993) supra; Hundle et 
al., (1991) supra; Misawa et al., (1991) supra; Sandmann et al., supra; and Schnurr et al., 
supra; all incorporated herein by reference in their entireties. 
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E. coli is an example of one type of bacteria which is suitable as a host for 
expression of the present enzymes (Cunningham et al, (1996) The Plant Cell 8:1613-1626, 
which is incorporated herein by reference in its entirety). A vector is used to construct 
plasmids containing genes encoding the enzymes of the invention, which vector allows it to 
coexist in E. coli with cloning vectors that contain the more common ColEl origin of 
replication. The addition of epsilon cyclic end groups to the pink-colored lycopene will 
result in products that are yellow or orange-yellow in color. Therefore, the functioning of 
the epsilon lycopene cyclase of the invention may be detected by a change in the color of E. 
coli cultures that accumulate lycopene. Such assays are termed color complementation 
assays. 

Alternatively, transgenic organisms can be constructed which include the DNA 
sequences of the present invention (Bird et al, 1991; Bramley et al, 1992; Misawa et al, 
1994a; Misawa et al, 1994b; Cunningham et al, 1993, all of which are incorporated by 
reference herein in their entireties). The incorporation of these sequences can allow the 
controlling of carotenoid biosynthesis, content, or composition in the host cell. These 
transgenic systems can be constructed to incorporate sequences which allow over-expression 
of the carotenoid genes of the present invention. Transgenic systems can also be 
constructed containing antisense expression of the DNA sequences of the present invention. 
Such antisense expression would result in the accumulation of the substrates of the enzyme 
encoded by the sense strand. 

Appropriate transgenic hosts include lettuce, the natural host, but also other plants 
such as marigold, tomato, pepper, banana, potato and the like. Essentially any plant is 
suitable for expressing the present enzyme, but the preferred plants are those which already 
produce high levels of carotenoids, and those which are normally ingested as foods or used 
as a source of carotenoid pigments. In particular, plants which bear fruit can be 
manipulated in such a way as to provide tissue-specific expression in fruit. Marigold is a 
particularly preferred host, because it can be used as a "bioreactor" for bulk production of 
carotenoids, and is actually grown commercially as a carotenoid source for chicken feed. 
For expression in marigold, a promoter can be used which is "flower-specific." Another 
preferred transgenic plant is tomato, because this plant already produces high levels of 
lycopene. Indeed, it has been reported that there is a correlation between consuming 
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tomatoes and decreased incidence of colon cancer (mayne, supra). 
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A method f or screening for eukarvotic genes which encode enzvmes involved in carotenoid 

biosynthesis 

The method of the present invention comprises transforming a prokaryotic host with 
a DNA which may contain a eukaryotic or prokaryotic carotenoid biosynthetic gene; 
culturing said transformed host to obtain colonies; and screening for colonies exhibiting a 
different color than colonies of the untransformed host. 

Suitable hosts include E. coli, cyanobacteria such as Synechococcus and 
Synechocystis, alga and plant cells. E. coli are preferred. 

In a preferred embodiment, the above "color complementation test" can be enhanced 
by using mutants which are either (1) deficient in at least one carotenoid biosynthetic gene 
or (2) overexpress at least one carotenoid biosynthetic gene. In either case, such mutants 
will accumulate carotenoid precursors. 

Prokaryotic and eukaryotic genomic and cDNA libraries can be screened in total for 
the presence of genes of carotenoid biosynthesis, metabolism and degradation. Preferred 
organisms to be screened include photosynthetic organisms, humans and animals. 

E. coli can be transformed with these eukaryotic cDNA libraries using conventional 
methods such as those described in Sambrook et al, 1989 and according to protocols 
described by the venders of the cloning vectors. 

For example, the cDNA libraries in bacteriophage vectors such as lambdaZAP 
(Stratagene) or lambdaZIPLOX (Gibco BRL) can be excised en masse and used to transform 
E. coli. Suitable vectors include pAGYC184, pUC119, pBR322 (available from New 
England BioLabs, Bevery, MA). pACYC is preferred. 

Transformed coli can be cultured using conventional techniques. The culture 
broth preferably contains antibiotics to select and maintain plasmids. Suitable antibiotics 
include penicillin, ampicillin, chloramphenicol, etc. Culturing is typically conducted at 15- 
45°C, preferably at room temperature (16-28°C), for 12 hours to 7 days. 

Cultures are plated and the plates are screened visually for colonies with a different 
color than the colonies of the host E. coli transformed with the empty vector. For example, 
E. coli transformed with the plasmid, pAC-BETA (described below), produce yellow 
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colonies that accumulate P-carotene. After transformation with a cDNA library, colonies 
which contain a different hue than those formed by E. co/i/pAC-BETA would be expected 
to contain enzymes which modify the structure or degree of expression of P-carotene. 
Similar standards can be engineered which overexpress earlier products in carotenoid 
biosynthesis, such as lycopene, y- carotene * etc - 

Having generally described this invention, a further understanding can be obtained 
by reference to certain specific examples which are provided herein for purposes of 
illustration only and are not intended to be limiting unless otherwise specified. 

EXAMPLE 

Isolation of lvcopene epsilon cyclase 

The lycopene epsilon cyclase was isolated from a romaine lettuce library obtained 
from Dr. Harry Y. Yamamoto (University of Hawaii, Honolulu) essentially as disclosed in 
Cunningham et al, 1996, supra, and Bugos and Yamamoto (1996) Proc. Natl. Acad. Sci. 
USA 93:6320-6325, both of which are incorporated herein by reference in their entireties. 
Functional clones were identified by the color complementation test. 

Pigment Analysis 

A single colony was used to inoculate 50 ml of LB containing ampicillin and 
chloramphenicol in a 250-ml flask. Cultures were incubated at 28° C for 36 hours with 
gentle shaking, and then harvested at 5000 rpm in an SS-34 rotor. The cells were washed 
once with distilled H 2 0 and resuspended with 0.5 ml of water. The extraction procedures 
and HPLC were essentially as described previously (Cunningham et al, 1994). 

Organisms and Growth Conditions 

E. coli strains TOP10 and TOP10 F (obtained from Invitrogen Corporation, San 
Diego, CA) and XLl-Blue (Stratagene) were grown in Luria-Bertani (LB) medium 
(Sambrook et al., 1989) at 37°C in darkness on a platform shaker at 225 cycles per min. 
Media components were from Difco (yeast extract and tryptone) or Sigma (NaCl). 
Ampicillin at 150 pig/mL and/or chloramphenicol at 50 /xg/mL (both from United States 
Biochemical Corporation) were used, as appropriate, for selection and maintenance of 
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plasmids. 

Mass Excis ion and Col or Complementation Screening of Romaine Lettuce cDNA 
Library 

A cDNA library of romaine lettuce in lambda ZAPII (Bugos & Yamamoto) was 
obtained from Henry Yamamoto, as noted above. An aliquot of each library was treated to 
cause a mass excision of the cDNAs and thereby produce a phagemid library according to 
the instructions provided by the supplier of the cloning vector (Stratagene; E. coli strain 
XLl-Blue and the helper phage R408 were used). The titre of the excised phagemid was 
determined and the library was introduced into a lycopene-accumulating strain of E. coli 
TOP10 F by incubation of the phagemid with the E, coli cells for 15 min at 37°C. Cells 
had been grown overnight at 30°C in LB medium supplemented with 2% (w/v) maltose and 
10 mM MgS0 4 (final concentration), and harvested in 1.5 ml microfiige tubes at a setting of 
3 on an Eppendorf microfiige (5415C) for 10 min. The pellets were resuspended in 10 mM 
MgS0 4 to a volume equal to one-half that of the initial culture volume. Transformants were 
spread on large (150 mm diameter) LB agar petri plates containing antibiotics to provide for 
selection of cDNA clones (ampicillin) and maintenance of pAC-LYC (chloramphenicol). 
Approximately 10,000 colony forming units were spread on each plate. Petri plates were 
incubated at room temperature for 2 to 7 days to allow maximum color development. Plates 
were screened visually with the aid of an illuminated 3x magnifier and a low power stage- 
dissecting microscope for the rare, pale pinkish-yellow to deep-yellow colonies that could be 
observed in the background of pink colonies. A colony color of yellow or pinkish-yellow 
was taken as presumptive evidence of a cyclization activity. These yellow colonies were 
collected with sterile toothpicks and used to inoculate 3ml of LB medium in culture tubes 
with overnight growth at 37 °C and shaking at 225 cycles/min. Cultures were split into two 
aliquots in microfiige tubes and harvested by centrifiigation at a setting of 5 in an Eppendorf 
541 5C microfiige. After discarding the liquid, one pellet was frozen for later purification of 
plasmid DNA. To the second pellet was added 1.5 ml EtOH, and the pellet was 
resuspended by vortex mixing, and extraction was allowed to proceed in the dark for 15-30 
min with occasional remixing. Insoluble materials were pelleted by centrifiigation at 
maximum speed for 10 min in a microfiige. Absorption spectra of the supernatant fluids 
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were recorded from 350-550 nm with a Perkin Elmer lambda six spectrophotometer. 



Analysis of isolated clones 

Eight of the yellow colonies contained e-carotene indicating that a single gene 
product catalyzes both cyclizations required to form the two e endgroups of the symmetrical 
e-carotene from the symmetrical precursor lycopene. 

The availability of the romaine lettuce gene encoding the € cyclase enables the 
directed manipulation of plant and algal species for modification of carotenoid content and 
composition. Through inactivation of the e cyclase, whether at the gene level by deletion of 
the gene or by insertional inactivation or by reduction of the amount of enzyme formed (by 
such as antisense technology), one may increase the formation of P-carotene and other 
pigments derived from it. Since vitamin A is derived only from carotenoids with p 
endgroups, an enhancement of the production of p-carotene versus a-carotene may enhance 
nutritional value of crop plants. Reduction of carotenoids with e endgroups may also be of 
value in modifying the color properties of crop plants and specific tissues of these plants. 
Alternatively, where production of a-carotene, or pigments such as lutein that are derived 
from a-carotene, is desirable, whether for the color properties, nutritional value or other 
reason, one may overexpress the e cyclase or express it in specific tissues. Wherever 
agronomic value of a crop is related to pigmentation provided by carotenoid pigments the 
directed manipulation of expression of the e cyclase gene and/or production of the enzyme 
may be of commercial value. 

The predicted amino acid sequence of the romaine lettuce e cyclase enzyme (SEQ ID 
NO:2) was determined. A comparison of the amino acid sequences of the e cyclase 
enzymes of Arabidopsis thaliana and romaine lettuce (Figure 5) as predicted by the DNA 
sequence of the respective genes (Fig. 3 for the e cyclase cDNA sequence), indicates that 
these two enzymes have many regions of sequence similarity, but they are only about 65% 
identical overall at the amino acid level. 
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Having now fully described the invention, it will be apparent to one of ordinary skill 
in the art that many changes and modifications can be made thereto without departing from 
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Claims; 

1 . An isolated eukaryotic enzyme which converts lycopene to epsilon, epsilon- 
carotene. 

2. An isolated eukaryotic enzyme of Claim 1 having the amino acid sequence of 
SEQ ID NO: 2. 

3. An isolated DNA sequence comprising a gene encoding the eukaryotic e cyclase 
of Claim 2. 

4. The isolated DNA sequence according to Claim 3, having the nucleic acid 
sequence of SEQ ID NO: 1 . 

5. An expression vector comprising the DNA sequence of Claim 3. 

6. A host containing the expression vector of Claim 5. 

7. The host of Claim 6, wherein said host is E. coli. 

8. The host of Claim 6, wherein said host is a plant. 

9. The host of Claim 8, wherein said host is marigold. 

10. The host of Claim 8, wherein said host is tomato. 

11. A composition comprising the host of Claim 6. 

12. A composition comprising the host of Claim 8. 

13. A composition comprising bicyclic epsilon carotene obtained from the host of 
Claim 6. 
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14. A composition comprising tricyclic epsilon carotene obtained from the host of 
Claim 8. 

15. A method for treating disease comprising administering to a patient in need 
thereof, an amount of the composition of Claim 13 sufficient to treat said disease. 

16. A method for treating disease comprising administering to a patient in need 
thereof, an amount of the composition of Claim 14 sufficient to treat said disease. 
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\|f endgroup 

h r" 





0 endgroup 



€ endgroup 



1 MECF6ARNMT ATMAVFTCPR FTDCNIRHKF SLLKQRRFTN LSASSSLRQI 

51 KCSAKSDRCV VDKQGISVAD EEDYVKAGGS ELFFVQMQRT KSMESQSKLS 

101 EKLAQIPIGN CILDLVVIGC GPAGLALAAE SAKLGLNVGL IGPDLPFTNN 

151 YGVWQDEFIG LGLEGCIEHS WKDTLVYLDD ADPIRIGRAY GRVHRDLLHE 

201 ELLRRCVESG VSYLSSKVER ITEAPNGYSL IECEGNITIP CRLATVASGA 

251 ASGKFLEYEL GGPRVCVQTA YGIEVEVENN PYDPDLMVFM DYRDFSKHKP 

301 ESLEAKYPTF LYVMAMSPTK IFFEETCLAS REAMPFNLLK SKLMSRLKAM 

351 GIRITRTYEE EVSYIPVGGS LPNTEQKNLA FGAAASMVHP ATGYSVVRSL 

401 SEAPNYAAVI AKILRQDQSK EMISLGKYTN I SKQAWETLW PLERKRQRAF 

451 FLFGLSHIVL XDLEGTRTFF RTFFRLPKWM WWGFLGSSLS STDLI IFALY 

501 MFVIAPHSLR MELVRHLLSD PTGATMVKAY LTIx 
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1 gaaacaaatg acgtgaaagt -tcttcaaaat tgaattaatt gtaatcctga 

51 aaacttgatt tgtgatagaa gaatcaatgg agtgctttgg agctcgaaac 

101 ortgacggcaa caatggcggt tt-ttacg-fcgc cctagattca cggactgtaa 

151 tatcaggcac aaattt-tcg-t tactgaaaca acgaagattt actaatttat 

201 cagcatcgtc ttcgttgcgt caaattaagt gcagcgctaa aagcgaccgt 

251 tgtgtagtgg ataaacaagg gatitccgta gcagacgaag aagattatgt 

301 gaaggccggt ggatcggagc tgttttttgt tcaaatgcag cggactaagi; 

351 ccatggaaag ccagtctaaa ct-ttccgaaa agctagcaca gataccaatt 

401 ggaaa-k-tgca tacttgatct ggttgtaatc ggttgtggcc ctgctggcct 

451 tgctcttgct gcagogtcag ccaaactagg gttgaacgtt ggactcattg 

501 gccctgatct iccttt"taca aacaattotg gtgtttggca ggatgaattt 

551 ataggtcttg gacttgaagg atgcattgaa cattctigga aagntactct 

601 tgtatacctt gatgatgctg ataatccg cataggtcgt gcatatggco 

651 gagttcatcg tgatttactt catgaagagt tgttaagaag g-tgtgtggaa 

710 tcagg-tgttt catatctaag ctccaaag-ta gaaagaatca ctgaagctcc 

751 aaatggctat agtctcattg aortgtgaagg caatatcacc <rttccatgca 

801 ggcttgctac -tgttgcatca ggggcagctt cagggaaati; tctggagtai; 

851 gaacttgggg gtccccgtgt ttgtg-tccaa acagcttatg gtatagaggt 

901 tgaggttgaa aacaacccct atgatccaga tctaQtggtg ttcatggatt 

951 atagagactt ctcaaaacat aaaccggaat ctttagaagc aaaatatccg 

1001 actttcctct atgtcatggc catgtctcca acaaaaatat tcttcgagga 

1051 aacttgttta gcttcaagag aagcca-tgcc tttcaatcti; ctaaagtcca 

1101 aactcatgtc acgattaaag gcaatgggta tccgaataac aagaacg-tac 

1151 gaagaggaat ggtcgtatat ccccgtaggt ggatcg-ttac ctaatacaga 

1201 acaaaagaat ctcgcatttg gtgctgcagc tag-tatggtg caccctgcca 

1251 cagggtattc agttg-ttcga tcttigtcag aagctcctaa ttatgcagca 

1301 gtcattgcta aga-ttttaag acaagatcaa tc-taaagaga tgatttctct 

1351 tggaaaatac actaacattt caaaacaagc atgggaaaca ti;gtggccac 

1401 ttgaaaggaa aagacagcga gccttc-tttc -tattcggact atcacacatc 

1451 g-tgctaatng atctagaggg aacacgtaca tttitccgta ctttctt-tcg 

1501 t-ttgcccaaa tggatgtggt ggggait-ttt ggggtcttct ttatcttcaa 

1551 cgga-tttgat aatatttgcg ctttatatg-t ttgigatagc acctcacagc 

1601 ttgagaatgg aactggttag acatctactt tctgatccga caggggcaac 

1651 tatggtaaaa gcatatctca ctatatagat "ttagat-fcata taaa-taatac 

1701 ccatatcttg catatatata agcctta-ttt ait-tcttttg tacccccaca 

1751 acaaca-tact cgttaattat atgittttta 
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Adonis palaestina e-cyclase cDNA 13 Length 1848 

1 gagagaaaaa gagtgttata ttaatgttac tgtcgcattc ttgcaacaca 
51 tattcagact ccattttctt gttttctctt caaaacaaca aactaatgtg 
101 acggagtatc tagctatgga Qciacttggt gttcgcaacc tcatctcttc 
151 ttgccctgtc tggacttttg gaacaagaaa ccttagtagt tcaaaactag 
201 cttataacat acatcgatcrt ggttcttctt gtagagtaga ttttcaagtg 
251 agggctgatg g-kggaagcgg gugtagaact tctgttgctt ataaagaggg 
301 ttttgtggac gaggaggatt ttatcaaagc tggtggttct gagcttttgt 
351 ttgtccaaat gcagcaaaca aagtctatgg agaaacaggc caagctcgcc 
401 gataagttgc caccaatacc tttcggagaa tctgtgatgg acttggttgt 
451 aataggttgt ggacctgctg gtrtttcact ggctgcagaa gctgctaagc 
501 taggcttgaa agttggcctt ottggtcctg atcttccttt tacaaataat 
551 tatggtgtgt gggaagacga gttcaaagat cttggacttg aacgttgtat 
601 cgagcatgct tggaaggaca ccatcgtata tcttgacaat gatgctcctg 
651 tccttattgg tcgtgcatai; ggacgagtta gccggcattt gctgcatgaa 
701 gagttgctga aaaggtgtgt cgagtcaggt gtatcatatc tgaattctaa 
751 agtggaaQgg atcartgaag ctgg-tga-tgg ccatagtctt gtagtttgtg 
801 aaaacgacat ctttatccct tgcaggcttg ctactgttgc atctggagca 
851 gcttcoggga aacttttgga g-tatgaagta ggtggccctc gtgtttgtgt 
901 ccaaactgct tatggtgtgg aggttgaggt ggagaacaat ccatacgatc 
951 ccaacttaat ggtatttatg gactacagag actatatgca acagaaatta 
1001 cagtgctcgg aagaagaata tccaacattt ctctatgtcQ tgcccatgtc 
1051 gccaacaaga cttttttttg aggaaacctg tttggcctca aaagatgcca 
1101 tgcctttcga tctactgaag agaaaactaa tgtcacgatt gaagactctg 
1151 ggtatccaag ttacaaaaat ttatgaagag gaa'tggtctt a-tattcctgt 
1201 tgggggttct ttaccaaaca cagagcaaaa gaacctagca tttggtgctg 
1251 cagcaagcat gg-tgcatcca gcaacaggct attcggttgt acgatcacta 
1301 tcagaagctc caaaatatgc ttctgtaatt gcaaagattt tgaagcaaga 
1351 taactc-tgca tatgtggttt ctggacaaag cagtgcagta aacatttcaa 
1401 tgcaagcatg gagcagtctt tggccaaagg agcgaaaacg tcaaagagca 
1451 ttctttcttt tcgggttaga gcttattgtg cagctagata ttgaagcaac 
1501 cagaacg-ttc tttagaacct tcttccgctt gccaacttgg atgtggtggg 
1551 g-tttcctigg gtcttcacta tcatctttcg atcttgtatt gttttccatg 
1601 tacatgtttg ttttggcccc gaacagcatg aggatgtcac ttgtgagaca 
1651 tttgct-ttca gatccttctg gtgcagttat ggi-taaagci tacctcgaaa 
1701 ggtaatctgt tttatgaaac tatagtgtct cattaaataa atgaggaicc 
1751 ttcgtatatg tatatgatca ictctatgta tatcctatat tctaatctca 
1801 taaag-taa-tc gaaaattcal; tgatagaaaa aaaaaaaaaa aaaaaaaa 
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Adonis palaestina e-cyclase #3 predicted polypeptide 
TRANSLATE from 116 to: 1705 Length i 529 amino acids 

1 MELLGVRNLI SSCPVWTFGT RNLSSSKLAY NIHRYGSSCR VDFQVRADGG 

51 SGSRTSVAYK EGFVDEEDF1 KAGGSELLFV QMQQTKSMEK QAKLADKLPP 

101 IPFGESVMDL VVIGCGPAGL SLAAEAAKLG LKVGLIGPDL PFTNNYGVWE 

151 DEFKDLGLER CIEHAWKDTI VYLDNDAPVL IGRAYGRVSR HLLHEELLKR 

201 CVESGVSYLN SKVERITEAG DGHSLVVCEN DIFIPCRLAT VASGAASGKL 

251 LEYEVGGPRV CVQTAYGVEV EVENNPYDPN LMVFMDYRDY MQQKLQCSEE 

301 EYPTFLYVMP MSPTRLFFEE TCLASKBAMP FDLLKRKLMS RLKTLGIQVT 

351 KIYEEEVSYI PVGGSLPNTE QKNLAFGAAA SMVHPATGYS VVRSLSEAPK 

401 YASVIAKILK QDNSAYVVSG QSSAVNISMQ AWSSLWPKER KRQRAFFLFG 

451 LELIVQLDIE ATRTFFRTFF RLPTWMWWGF LGSSLSSFDL VLFSMYMFVL 

501 APNSMRMSLV RHLLSDPSGA VMVKAYLER* 
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GAP program of Genetics Computer Group 
blosun 62, cnp 

Gap Weight 12 Average Match' 2.912 

Length Weight 4 Average Mismatch: -2,003 

Quality: 2728 Length: 530 

Ratio: 5. 147 Gaps: 0 

Percent Similarity: 99.623 Percent Identity: 99.057 

Match display thresholds for the alignment (s): I = IDENTITY « =2 .=1 

Adonis palaestina £-cyclase #3 x Adonis palaestina e-cyclase S5 



1 MELLGVRNLISSCPVWTFGTRNLSSSKLAYNIHRYGSSCRVDFQVRADGG 50 

i mellgvrnlIsscpvwIf^ 50 

51 SGSRTSVAYKEGFVDEEDFIKAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 

51 SGSRSS V AYKEGF VDEEDF I KAGGSELLFVQMQQTKSMEKQAKLADKLPP 100 

101 I PFGESVMDLVV I GCGPAGLSL AAEAAKLGLKVGL I GPDLPFTNN YGVWE 150 

101 I PFGESVMDLVV I GCGPA^ 150 

151 DEFKDLGLERC I EHAWKDT I V YLDNDAPVL I GRAYGRVSRHLLHEELLKR 200 

151 DEFKDLGLERC I EHAWKDT I V YLDNDAPVL I GRAYGRVSRHLLHEELLKR 200 

201 CVESGVSYLNSKVER I TEAGDGHSLVVCENDIF I PCRLATVASGAASGKL 250 

201 CVESGVSYLDSKVERITEAGDGHSLVVCENEIFIPCRLATVASGAASGKL 250 

« • 1 

251 LEYEVGGPRVCVQTAYGVEVEVENNPYDPNLMVFMDYRDYMQQKLQCSEE 300 
251 LEYEVGGPRVCVQW 300 

FIG. 
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301 EYPTFLYVMPMSPTRLFFEETCLASKDAMPFDLLKRKLMSRLKTLGIQVT 350 

301 E YPTFL YVMPMSPTRLFFEETCLASKL AMPFDLLKRKLMSRLKTLG I QVT 350 

351 K I YEEEWS Y I PVGGSLPNTEQKNL AFG AAASM VHPATGYSVVRSLSEAPK 400 

351 KVYEEEWSY I PVGGSLPNTEQKNL AFG AAASMVHPATGYSVVRSLSEAPK 400 

401 Y ASV I AK I LKQDNS AYVVSGQSSAVN I SMQAWSSLWPKERKRQRAFFLFG 450 

401 Y AS V I AK I LKQDNS AY VVSGQSSAVN I SMQAWSSLWPKERKRQRAFFLFG 450 

451 LEL I VQLD I EATRTFFRTFFRLPTWMWWGFLGSSLSSFdLvLFSMYMFvL 500 

451 LEL I VQLD I E ATRTFFRTFFRLPTUMWWGFLGSSLSSFDLVLFSMYMFVL 500 

501 APNSMRMSLVRHLLSDPSGAVMVKAYLERx 530 

501 APNSMRMSLVRHLLSDPSGAVMVRAYLER* 530 



SUBSTITUTE SHEET (RULE 26) 



WO 99/61399 



SEQUENCE LISTING 



PCT/US99/10461 



<110> CUNNINGHAM JR., FRANCIS X. 
SUN, ZAIREN 



<12 0> GENES ENCODING EPSILON LYCOPENE CYCLASE AND METHOD FOR 
PRODUCING BICYCLIC EPSILON CAROTENE 

<130> 2747-0084-27 CIP 



<140> 09/084,222 
<141> 1998-05-26 

<150> 08/937,155 
<151> 1997-09-25 

<150> 08/624,125 
<151> 1996-03-29 

<160> 6 

<170> Patentln Ver. 2.0 



<210> 1 
<211> 1780 
<212> DNA 

<213> romaine lettuce 

<223> n is an unspecified nucleotide 



<400> 1 

gaaacaaatg acgtgaaagt tcttcaaaat 
tgtgatagaa gaatcaatgg agtgctttgg 
ttttacgtgc cctagattca cggactgtaa 
acgaagattt actaatttat cagcatcgtc 
aagcgaccgt tgtgtagtgg ataaacaagg 
gaaggccggt ggatcggagc tgttttttgt 
ccagtctaaa ctttccgaaa agctagcaca 
ggttgtaatc ggttgtggcc ctgctggcct 
gttgaacgtt ggactcattg gccctgatct 
ggatgaattt ataggtcttg gacttgaagg 
tgtatacctt gatgatgctg atcccatccg 
tgatttactt catgaagagt tgttaagaag 
ctccaaagta gaaagaatca ctgaagctcc 
caatatcacc attccatgca ggcttgctac 
tctggagtat gaacttgggg gtccccgtgt 
tgaggttgaa aacaacccct atgatccaga 
ctcaaaacat aaaccggaat ctttagaagc 
catgtctcca acaaaaatat tcttcgagga 
tttcaatctt ctaaagtcca aactcatgtc 
aagaacgtac gaagaggaat ggtcgtatat 
acaaaagaat ctcgcatttg gtgctgcagc 
agttgttcga tctttgtcag aagctcctaa 
acaagatcaa tctaaagaga tgatttctct 



tgaattaatt gtaatcctga aaacttgatt 6 0 
agctcgaaac atgacggcaa caatggcggt 120 
tatcaggcac aaattttcgt tactgaaaca 180 
ttcgttgcgt caaattaagt gcagcgctaa 240 
gatttccgta gcagacgaag aagattatgt 3 00 
tcaaatgcag cggactaagt ccatggaaag 3 60 
gataccaatt ggaaattgca tacttgatct 420 
tgctcttgct gcagagtcag ccaaactagg 480 
tccttttaca aacaattatg gtgtttggca 540 
atgcattgaa cattcttgga aagatactct 600 
cataggtcgt gcatatggca gagttcatcg 660 
gtgtgtggaa tcaggtgttt catatctaag 72 0 
aaatggctat agtctcattg aatgtgaagg 780 
tgttgcatca ggggcagctt cagggaaatt 84 0 
ttgtgtccaa acagcttatg gtatagaggt 900 
tctaatggtg ttcatggatt atagagactt 96 0 
aaaatatccg actttcctct atgtcatggc 1020 
aacttgttta gcttcaagag aagccatgcc 1080 
acgattaaag gcaatgggta tccgaataac 1140 
ccccgtaggt ggatcgttac ctaatacaga 12 0 0 
tagtatggtg caccctgcca cagggtattc 1260 
ttatgcagca gtcattgcta agattttaag 1320 
tggaaaatac actaacattt caaaacaagc 13 8 0 
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atgggaaaca ttgtggccac ttgaaaggaa aagacagcga gccttccttc tattcggact 1440 

atcacacatc gtgctaatng atctagaggg aacacgtaca tttttccgta ctttctttcg 1500 

tttgcccaaa tggatgtggt ggggattttt ggggtcttct ttatcttcaa cggatttgat 1560 

aatatttgcg ctttatatgt ttgtgatagc acctcacagc ttgagaatgg aactggttag 1620 

acatctactt tctgatccga caggggcaac tatggtaaaa gcatatctca ctatatagat 1680 

ttagattata taaataatac ccatatcttg catatatata agccttattt atttcttttg 1740 

tacccccaca acaacatact cgttaattat atgtttttta 1780 

<210> 2 

<211> 533 

<212> PRT 

<213> roniairie lettuce 

<400> 2 

Met Glu Cys Phe Gly Ala Arg Asn Met Thr Ala Thr Met Ala Val Phe 
1 ' 5 10 .15 

Thr Cys Pro Arg Phe Thr Asp Cys Asn lie Arg His Lys Phe Ser Leu 
20 25 30 

Leu Lys Gin Arg Phe Thr Asn Leu Ser Ala Ser Ser Ser Leu Arg 
3 5 40 4 5 

Gin He Lys Cys Ser Ala Lys Ser Asp Arg Cys Val Val Asp Lys Gin 
50 55 6 0 

Gly He Ser Val Ala Asp Glu Glu Asp Tyr Val Lys Ala Gly Gly Ser 
65 70 75 80 

Glu Leu Phe Phe Val Gin Met Gin Arg Thr Lys Ser Met Glu Ser Gin 

85 90 95 

Ser Lys Leu Ser Glu Lys Leu Ala Gin He Pro He Gly Asn Cys He 
100 105 HO 

Leu Asp Leu Val Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala 
115 120 125 

Ala Glu Ser Ala Lys Leu Gly Leu Asn Val Gly Leu He Gly Pro Asp 
13 0 .135 14 0 

Leu Pro Phe Thr Asn Asn Tyr Gly Val Trp Gin Asp Glu Phe He Gly 
145 150 155 160 

Leu Gly Leu Glu Gly Cys He Glu His Ser Trp Lys Asp Thr Leu Val 
165 170 175 

Tyr Leu Asp Asp Ala Asp Pro He Arg He Gly Arg Ala Tyr Gly Arg 
180 185 190 

Val His Arg Asp Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu 
195 200 205 
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ser Gly Val Ser Tyr Leu Ser Ser Lys Val Glu Arg lie Thr Glu Ala 
210 215 220 

Pro Asn Gly Tyr Ser Leu He Glu Cys Glu Gly Asn He Thr lie Pro 



225 



230 235 240 



Cys Arg Leu Ala Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Phe Leu 
245 250 250 

Glu Tyr Glu Leu Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly 
260 265 270 

lie Glu Val Glu Val Glu Asn Asn Pro Tyr Asp Pro Asp Leu Met Val 



275 



280 



285 



Phe Met Asp Tyr Arg Asp Phe Ser Lys His Lys Pro Glu Ser Leu Glu 
290 295 300 

Ala Lys Tyr Pro Thr Phe Leu Tyr Val Met Ala Met Ser Pro Thr Lys 



305 



310 315 320 



lie Phe Phe Glu Glu Thr Cys Leu Ala Ser Arg Glu Ala Met Pro Phe 



325 



330 



Asn Leu Leu Lys Ser Lys Leu Met Ser Arg Leu Lys Ala Met Gly He 



340 



345 350 



Arg 



lie Thr Arg Thr Thr Glu Glu Glu Trp Ser Tyr He Pro Val Gly 



355 



360 



365 



Gly Ser Leu Pro Asn Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala 
370 375 380 

Ala Ser Met Val His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu 
385 390 395 400 

Glu Ala Pro Asn Tyr Ala Ala Val He Ala Lys He Leu Arg Gin 



Ser 

405 



410 



Asp Gin Ser Lys Glu Met lie Ser Leu Gly Lys Tyr Thr Asn He Ser 
420 «5 430 

Lys Gin Ala Trp Glu Thr Leu Trp Pro Leu Glu Arg Lys Arg Gin Arg 



435 



440 445 



Ala Phe Phe Leu Phe Gly Leu Ser His He Val Leu Met Asp Leu Glu 
450 455 460 

Gly Thr Arg Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Lys Trp Met 
465 470 475 480 

Trp Trp Gly Phe Leu Gly Ser Ser Leu Ser Ser Thr Asp Leu He He 
485 490 495 
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Phe Ala Leu Tyr Met Phe Val He Ala Pro His Ser Leu Arg Met Glu 
500 505 510 

Leu Val Arg His Leu Leu Ser Asp Pro Thr Gly Ala Thr Met Val Lys 
515 520 525 

Ala Tyr Leu Thr He 

530 
<210> 3 
<211> 524 
<212> PRT 
<213> Arabidopsis 

<400> 3 

Met Glu Cys Val Gly Ala Arg Asn Phe Ala Ala Met Ala Val Ser Thr 
1 * 5 10 15 

Phe Pro Ser Trp Ser Cys Arg Arg Lys Phe Pro Val Val Lys Arg Tyr 
20 25 30 

Ser Tyr Arg Asn lie Arg Phe Gly Leu Cys Ser Val Arg Ala Ser Gly 
35 40 45 

Gly Gly Ser Ser Gly Ser Glu Ser Cys Val Ala Val Arg Glu Asp Phe 
50 55 60 

Ala Asp Glu Glu Asp Phe Val Lys Ala Gly Gly Ser Glu He Leu Phe 
65 70 75 80 

Val Gin Met Gin Gin Asn Lys Asp Met Asp Glu Gin Ser Lys Leu Val 

85 90 95 

Asp Lys Leu Pro Pro He Ser He Gly Asp Gly Ala Leu Asp His Val 
100 105 HO 

Val He Gly Cys Gly Pro Ala Gly Leu Ala Leu Ala Ala Glu Ser Ala 
115 120 125 

Lys Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr 
130 135 140 

Asn Asn Tyr Gly Val Trp Glu Asp Glu Phe Asn Asp Leu Gly Leu Gin 
145 150 155 160 

Lys Cys He Glu His Val Trp Arg Glu Thr He Val Tyr Leu Asp Asp 
165 170 175 

Asp Lys Pro He Thr He Gly Arg Ala Tyr Gly Arg Val Ser Arg Arg 
180 185 190 

Leu Leu His Glu Glu Leu Leu Arg Arg Cys Val Glu Ser Gly Val Ser 
195 200 205 
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Tyr Leu Ser Ser Lys Val Asp Ser lie Thr Glu Ala Ser Asp Gly Leu 
210 215 220 

Arg Leu Val Ala Cys Asp Asp Asn Asn Val He Pro Cys Arg Leu Ala 



225 



230 235 240 



Thr Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Gin Tyr Glu Val 
245 250 255 

Gly Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu 
260 265 270 

Val Glu Asn Ser Pro Tyr Asp Pro Asp Gin Met Val Phe Met Asp Tyr 
275 280 285 

Arg Asp Tyr Thr Asn Glu Lys Val Arg Ser Leu Glu Ala Glu Tyr Pro 
290 295 300 

Thr Phe Leu Tyr Ala Met Pro Met Thr Lys Ser Arg Leu Phe Phe Glu 
305 310 315 320 

Glu Thr Cys Leu Ala Ser Lys Asp Val Met Pro Phe Asp Leu Leu Lys 
325 330 335 

Thr Lys Leu Met Leu Arg Leu Ser Thr Leu Gly He Arg He Leu Lys 
340 345 350 



Thr Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro 
355 



360 365 



Asn 

370 



Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val 



375 380 



His Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro 
385 390 395 400 

Lys Tyr Ala Ser Val He Ala Glu He Leu Arg Glu Glu Thr Thr Lys 
7 405 410 415 

He Asn Ser Asn He Ser Arg Gin Ala Trp Asp Thr Leu Trp Pro 



Gin 



420 



425 430 



Pro Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu Phe Gly Leu Ala Leu 
435 440 445 

He Val Gin Phe Aso Thr Glu Gly He Arg Ser Phe Phe Arg Thr Phe 
450 455 460 

Phe Arg Leu Pro Lys Trp Met Trp Gin Gly Phe Leu Gly Ser Thr Leu 
465 470 475 480 

Thr Ser Gly Asp Leu Val Leu Phe Ala Leu Tyr Met Phe Val He Ser 
485 490 495 
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Pro Asn Asn 



Leu Arg Lys Gly Leu He Asn His Leu lie Ser Asp Pro 
500 505 5X0 



Thr Gly Ala 
515 



Thr Met He Lys Thr Tyr Leu Lys Val 
520 



<210> 4 
<211> 1848 
<212> DNA 

<213> Adonis palaestina 
<400> 4 

gagagaaaaa gagtgttata ttaatgttac tgtcgcattc ttgcaacaca tattcagact 6 0 
ccattttctt gttttctctt caaaacaaca aactaatgtg acggagtatc tagctatgga 120 
actacttggt gttcgcaacc tcatctcttc ttgccctgtc tggacttttg gaacaagaaa 180 
ccttagtagt tcaaaactag cttataacat acatcgatat ggttcttctt gtagagtaga 240 
ttttcaagtg agggctgatg gtggaagcgg gagtagaact tctgttgctt ataaagaggg 3 00 
ttttgtggac gaggaggatt ttatcaaagc tggtggttct gagcttttgt ttgtccaaat 3 60 
gcagcaaaca aagtctatgg agaaacaggc caagctcgcc gataagttgc caccaatacc 420 
tttcggagaa tctgtgatgg acttggttgt aataggttgt ggacctgctg gtctttcact 480 
ggctgcagaa gctgctaagc taggcttgaa agttggcctt attggtcctg atcttccttt 540 
tacaaataat tatggtgtgt gggaagacga gttcaaagat cttggacttg aacgttgtat 600 
cgagcatgct tggaaggaca ccatcgtata tcttgacaat gatgctcctg tccttattgg 660 
tcgtgcatat ggacgagtta gccggcattt gctgcatgaa gagttgctga aaaggtgtgt 720 
cgagtcaggt gtatcatatc tgaattctaa agtggaaagg atcactgaag ctggtgatgg 780 
ccatagtctt gtagtttgtg aaaacgacat ctttatccct tgcaggcttg ctactgttgc 840 
atctggagca gcttcaggga aacttttgga gtatgaagta ggtggccctc gtgtttgtgt 900 
ccaaactgct tatggtgtgg aggttgaggt ggagaacaat ccatacgatc ccaacttaat 960 
ggtatttatg gactacagag actatatgca acagaaatta cagtgctcgg aagaagaata 1020 
tccaacattt ctctatgtca tgcccatgtc gccaacaaga cttttttttg aggaaacctg 1080 
tttggcctca aaagatgcca tgcctttcga tctactgaag agaaaactaa tgtcacgatt 1140 
gaagactctg ggtatccaag ttacaaaaat ttatgaagag gaatggtctt atattcctgt 1200 
tgggggttct ttaccaaaca cagagcaaaa gaacctagca tttggtgctg cagcaagcat 1260 
ggtgcatcca gcaacaggct attcggttgt acgatcacta tcagaagctc caaaatatgc 13 20 
ttctgtaatt gcaaagattt tgaagcaaga taactctgca tatgtggttt ctggacaaag 13 80 
cagtgcagta aacatttcaa tgcaagcatg gagcagtctt tggccaaagg agcgaaaacg 1440 
tcaaagagca ttctttcttt tcgggttaga gcttattgtg cagctagata ttgaagcaac 1500 
cagaacgttc tttagaacct tcttccgctt gccaacttgg atgtggtggg gtttccttgg 1560 
gtcttcacta tcatctttcg atcttgtatt gttttccatg tacatgtttg ttttggcccc 1620 
gaacagcatg aggatgtcac ttgtgagaca tttgctttca gatccttctg gtgcagttat 1680 
ggttaaagct tacctcgaaa ggtaatctgt tttatgaaac tatagtgtct cattaaataa 1740 
atgaggatcc ttcgtatatg tatatgatca tctctatgta tatcctatat tctaatctca 1800 
taaagtaatc gaaaattcat tgatagaaaa aaaaaaaaaa aaaaaaaa 1848 

<210> 5 
<211> 529 
<212> PRT 

<213> Adonis palaestina 



<400> 5 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 
1 .5 10 15 



SUSSTITUT1 SHEET (ROLE 28) 
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Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn lie 
20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg val Asp Phe Gin Val Arg Ala Asp 
35 40 45 

Gly Gly Ser Gly Ser Arg Thr Ser Val Ala Tyr Lys Glu Gly Phe Val 
50 55 60 

Asp Glu Glu Asp Phe He Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 



65 



70 75 80 



Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 

85 90 95 

Lys Leu Pro Pro He Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
X00 105 11° 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 I 25 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 I* 0 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 160 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 175 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 I 90 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asn ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 

Leu Val val Cys Glu Asn Asp He Phe He Pro Cys Arg Leu Ala Thr 
225 230 235 240 

Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Gly Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 
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Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 J 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys He 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 ' ' 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 " 390 395 400 

Tyr Ala Ser Val He Ala Lys He Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Leu 
435 440 445 

Phe Gly Leu Glu Leu He Val Gin Leu Asp lie Glu Ala Thr Arg Thr 
450 455 460 

Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly Phe 
465 470 475 480 

Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met Tyr 
485 490 495 

Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg His 
500 505 510 

Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Lys Ala Tyr Leu Glu 
515 520 525 

Arg 



<210> 6 
<211> 530 
<212> PRT 

<213> Adonis palaestina 
<400> 6 

Met Glu Leu Leu Gly Val Arg Asn Leu He Ser Ser Cys Pro Val Trp 
1 5 10 15 
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Thr Phe Gly Thr Arg Asn Leu Ser Ser Ser Lys Leu Ala Tyr Asn He 

20 25 30 

His Arg Tyr Gly Ser Ser Cys Arg Val Asp Phe Gin Val Arg Ala Asp 

35 40 45 

Gly Gly Ser Gly Ser Arg Ser Ser Val Ala Tyr Lys Glu Gly Phe Val 

50 55 60 



Asp 
65 



Glu Glu Asp Phe He Lys Ala Gly Gly Ser Glu Leu Leu Phe Val 

70 75 80 



Gin Met Gin Gin Thr Lys Ser Met Glu Lys Gin Ala Lys Leu Ala Asp 

85 90 95 

Lys Leu Pro Pro He Pro Phe Gly Glu Ser Val Met Asp Leu Val Val 
100 105 HO 

He Gly Cys Gly Pro Ala Gly Leu Ser Leu Ala Ala Glu Ala Ala Lys 
115 120 125 

Leu Gly Leu Lys Val Gly Leu He Gly Pro Asp Leu Pro Phe Thr Asn 
130 135 140 

Asn Tyr Gly Val Trp Glu Asp Glu Phe Lys Asp Leu Gly Leu Glu Arg 
145 150 155 l eo 

Cys He Glu His Ala Trp Lys Asp Thr He Val Tyr Leu Asp Asn Asp 
165 170 I 75 

Ala Pro Val Leu He Gly Arg Ala Tyr Gly Arg Val Ser Arg His Leu 
180 185 190 

Leu His Glu Glu Leu Leu Lys Arg Cys Val Glu Ser Gly Val Ser Tyr 
195 200 205 

Leu Asp Ser Lys Val Glu Arg He Thr Glu Ala Gly Asp Gly His Ser 
210 215 220 



Leu 
225 



Val Val Cys Glu Asn Glu He Phe He Pro Cys Arg Leu Ala Thr 
230 235 240 



Val Ala Ser Gly Ala Ala Ser Gly Lys Leu Leu Glu Tyr Glu Val Gly 
245 250 255 

Glv Pro Arg Val Cys Val Gin Thr Ala Tyr Gly Val Glu Val Glu Val 
260 265 270 

Glu Asn Asn Pro Tyr Asp Pro Asn Leu Met Val Phe Met Asp Tyr Arg 
275 280 285 

Asp Tyr Met Gin Gin Lys Leu Gin Cys Ser Glu Glu Glu Tyr Pro Thr 
290 295 300 
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Phe Leu Tyr Val Met Pro Met Ser Pro Thr Arg Leu Phe Phe Glu Glu 
305 310 315 320 

Thr Cys Leu Ala Ser Lys Asp Ala Met Pro Phe Asp Leu Leu Lys Arg 
325 330 335 

Lys Leu Met Ser Arg Leu Lys Thr Leu Gly He Gin Val Thr Lys Val 
340 345 350 

Tyr Glu Glu Glu Trp Ser Tyr He Pro Val Gly Gly Ser Leu Pro Asn 
355 360 365 

Thr Glu Gin Lys Asn Leu Ala Phe Gly Ala Ala Ala Ser Met Val His 
370 375 380 

Pro Ala Thr Gly Tyr Ser Val Val Arg Ser Leu Ser Glu Ala Pro Lys 
385 390 395 400 

Tyr Ala Ser Val He Ala Lys He Leu Lys Gin Asp Asn Ser Ala Tyr 
405 410 415 

Val Val Ser Gly Gin Ser Ser Ala Val Asn He Ser Met Gin Ala Trp 
420 425 430 

Ser Ser Leu Trp Pro Lys Glu Arg Lys Arg Gin Arg Ala Phe Phe Thr 
435 440 445 

Leu Phe Gly Leu Glu Leu He Val Gin Leu Asp He Glu Ala Thr Arg 
450 455 460 

Thr Phe Phe Arg Thr Phe Phe Arg Leu Pro Thr Trp Met Trp Trp Gly 
465 470 475 480 

Phe Leu Gly Ser Ser Leu Ser Ser Phe Asp Leu Val Leu Phe Ser Met 
485 490 495 

Tyr Met Phe Val Leu Ala Pro Asn Ser Met Arg Met Ser Leu Val Arg 
500 505 510 

His Leu Leu Ser Asp Pro Ser Gly Ala Val Met Val Arg Ala Tyr Leu 
515 520 525 

Glu Arg 
530 
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